PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Neem_628_f_5
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Sapindales; Meliaceae; Azadirachta
Family HD-ZIP
Protein Properties Length: 757aa    MW: 83738.6 Da    PI: 6.2294
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Neem_628_f_5genomeNGDView Nucleic Acid
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox67.22.2e-21103158156
                   TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
      Homeobox   1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                   r+k +++t+eq++e+e+lF+++++p++++r++L+k+lgL  rqVk+WFqNrR++ k
  Neem_628_f_5 103 RKKYHRHTAEQIREMEALFKESPHPDEKQRQQLSKQLGLAPRQVKFWFQNRRTQIK 158
                   7999************************************************9877 PP

2START229.59.4e-722784967206
                   HHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S....EEEEEEEECTT. CS
         START   7 aqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla....kaetlevissg. 88 
                    +el k+a+a+ep+W +s+    e++n+de++++f+ ++       +s+ea+r++gvv+ +l++lv++++d++ qW+ +++    ka+t++vi+sg 
  Neem_628_f_5 278 IEELKKMATAGEPLWIRSVetgrEILNYDEYIKEFSVENPsngkpkRSIEASRETGVVFVDLPKLVQSFMDVN-QWKAMFPclisKAATVDVICSGe 373
                   67999******************************88777999******************************.*********************** PP

                   .....EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--SSXXHH CS
         START  89 .....galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkgrlphw 179
                        ga+qlm+aelq+l+p+vp R+++fvRy++ql+a++w+ivdvS+d  +++  ++s+v+++++pSg++ie+ksngh+kv+wveh +++++++h 
  Neem_628_f_5 374 ganrnGAVQLMFAELQMLTPMVPtREVYFVRYCKQLSAEQWAIVDVSIDKVEENI-DASLVKCRKRPSGCIIEDKSNGHCKVIWVEHLECQKATVHT 469
                   *****************************************************98.9**************************************** PP

                   HHHHHHHHHHHHHHHHHHHHTXXXXXX CS
         START 180 llrslvksglaegaktwvatlqrqcek 206
                   ++rs+v+sgla+ga++w+atlq qce+
  Neem_628_f_5 470 MYRSIVNSGLAFGARHWMATLQLQCER 496
                   *************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.609.4E-2488154IPR009057Homeodomain-like
SuperFamilySSF466898.77E-2190161IPR009057Homeodomain-like
PROSITE profilePS5007118.22100160IPR001356Homeobox domain
SMARTSM003898.3E-19102164IPR001356Homeobox domain
PfamPF000461.0E-18103158IPR001356Homeobox domain
CDDcd000865.92E-17107158No hitNo description
PROSITE patternPS000270135158IPR017970Homeobox, conserved site
PROSITE profilePS5084838.497263499IPR002913START domain
SuperFamilySSF559611.92E-32265496No hitNo description
CDDcd088752.53E-111267495No hitNo description
SMARTSM002341.7E-72272496IPR002913START domain
PfamPF018524.4E-57278496IPR002913START domain
Gene3DG3DSA:3.30.530.204.2E-6323491IPR023393START-like domain
SuperFamilySSF559611.3E-15525744No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0009957Biological Processepidermal cell fate specification
GO:0010062Biological Processnegative regulation of trichoblast fate specification
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 757 aa     Download sequence    Send to blast
MGVDMSNNPP TSRTKDFFAS PALSLSLAGI FRDAGAAAAS AAEANNEVEE GDEGSGGGGS  60
RREETLEISS ENSGPGRSRS DDEFDGGEHD DDEDGDKNKK KKRKKYHRHT AEQIREMEAL  120
FKESPHPDEK QRQQLSKQLG LAPRQVKFWF QNRRTQIKAI QERHENSLLK AEMEKLRDEN  180
KAMRETINKA CCPNCGMATT SRDTTVTTEE QQLRIENAKL KAEVEKLRAA AGKCPPGGTS  240
TSSCSAANDQ ENRSSLDFYT GIFGLEKSRI TELVNQGIEE LKKMATAGEP LWIRSVETGR  300
EILNYDEYIK EFSVENPSNG KPKRSIEASR ETGVVFVDLP KLVQSFMDVN QWKAMFPCLI  360
SKAATVDVIC SGEGANRNGA VQLMFAELQM LTPMVPTREV YFVRYCKQLS AEQWAIVDVS  420
IDKVEENIDA SLVKCRKRPS GCIIEDKSNG HCKVIWVEHL ECQKATVHTM YRSIVNSGLA  480
FGARHWMATL QLQCERLVFF MATNVPTKDS TGVATLAGRK SILKLAQRMT WNFCRAIAAS  540
SYHTWNKVAS KTGEDIRVSS RKNLNDPGEP HGVILCAVSS VWLPVSPHVL FDFLRDEAHR  600
NEWDIMSNGG PVQTIANLAK GQDRGNAVTI QAMKSKENSM WVLQDSCTNA YESMVIYAPV  660
DITGMQSVIT GCDSSNIAIL PSGFSILPDG LESRPLVITS RQEEKSTEGG SLLTIAFQIL  720
TNNSPTAKLT MESVESVNTL ISCTLQNIKT SLQCEDA
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
198103KKKKRK
298104KKKKRKK
3100104KKRKK
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankJX6002474e-89JX600247.1 Gossypium hirsutum clone NBRI_GE38006 microsatellite sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_008228560.10.0PREDICTED: homeobox-leucine zipper protein GLABRA 2
SwissprotP466070.0HGL2_ARATH; Homeobox-leucine zipper protein GLABRA 2
TrEMBLM5WXG00.0M5WXG0_PRUPE; Uncharacterized protein
STRINGPOPTR_0003s05100.10.0(Populus trichocarpa)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MalvidsOGEM123702731
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G79840.10.0HD-ZIP family protein